A Unit Selection-based Speech Synthesis Approach for Mandarin Chinese

نویسندگان

  • Minghui Dong
  • Kim-Teng Lua
  • Haizhou Li
چکیده

The paper presents a unit selection-based speech synthesis approach for mandarin Chinese. Unit selection-based approach generates speech by selecting proper units from a speech corpus and connecting them together. In this approach, a set of features are defined to describe the speech units in the corpus and the expected units in the synthesized utterance. Based on the features, cost function is defined to select a sequence of units that are able to generate high quality speech. The cost function describes the two aspects of the generated speech, ie. (1) the appropriateness level of each unit itself. (2) the smoothness level between two units to be concatenated. Viterbi search algorithm is used in this approach to find the best unit sequence that minimizes the two costs. The authors use a new prosody description to ensure the prosody quality of the generated speech. Experiment shows that this approach can generate very high quality speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Unit Selection-based Speech Synthesis Approach for Chinese Mandarin Text-to-Speech

The paper presents a unit selection-based speech synthesis approach for Chinese Mandarin. Unit selection-based approach generates speech by directly connecting pre-recorded speech units. In this approach, a corpus is used as a source unit inventory. A feature vector is defined to describe each unit. To generate speech, the feature vector of the target unit is first calculated. During synthesis,...

متن کامل

Study on Unit-Selection and Statistical Parametric Speech Synthesis Techniques

One of the interesting topics on multimedia domain is concerned with empowering computer in order to speech production. Speech synthesis is granting human abilities to the computer for speech production. Data-based approach and process-based approach are the two main approaches on speech synthesis. Each approach has its varied challenges. Unit-selection speech synthesis and statistical parametr...

متن کامل

Issues in Text-to-Speech Conversion for Mandarin

Research on text-to-speech (TTS) conversion for Mandarin Chinese is a much younger enterprise than comparable research for English or other European languages. Nonetheless, impressive progress has been made over the last couple of decades, and Mandarin Chinese systems now exist which approach, or in some ways even surpass in quality available systems for English. This article has two goals. The...

متن کامل

A novel hybrid approach for Mandarin speech synthesis

The paper investigates a new method to solve concatenation problems of Mandarin speech synthesis which is based on the hybrid approach of HMM-based speech synthesis and unit selection. Unlike other works which use only boundary F0 errors as concatenation cost, a CART based F0 dependency model which considers much context information is trained to measure smoothness of F0. Instead of phoneme-siz...

متن کامل

The WISTON Text to Speech System for Blizzard Challenge 2010

The paper introduces the speech synthesis system developed by Institute of Automation, Chinese Academy of Sciences(CASIA) for Blizzard Challenge 2010. The large corpus based speech synthesis system, WISTON, was built to synthesize Mandarin speech. In this year, a new prosodic structure prediction model was used, which is more precise and compact than before. Furthermore, two kinds of syllable s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of Chinese Language and Computing

دوره 16  شماره 

صفحات  -

تاریخ انتشار 2006